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The invention relates to coding a set of input values into a set of coefficients 
by use of a given algorithm. This algorithm may be a Discrete Cosine Transformation (DCT), 
which algorithm is widely used in the field of image and video coding. 

Pao and Sun [5] disclose that digital video coding standards such as H.263 and 
MPEG are becoming more and more important for multimedia applications. Due to the huge 
amount of computations required, there are significant efforts to speed up the processing of 
video encoders. Previously, the efforts were mainly focused on the fast motion-estimation 
algorithm. However, as the motion-estimation algorithm becomes optimized, to speed up the 
video encoders further other functions such as discrete cosine transform (DCT) and inverse 
DCT (IDCT) need be optimized. Pao and Sun propose a theoretical model for DCT 
coefficients. Based on this model, it is shown that the variances of the DCT coefFicients can 
be represented as a function of the minimum mean absolute error (MMAE) after motion- 
compensated prediction. An adaptive method with multiple thresholds is derived from the 
statistical model to reduce the computations of DCT, IDCT, quantization and inverse- 
quantization. Pao and Sun further present a DCT approximation algorithm that can further 
speed up the calculations of DCT when the quantization step is large. An improvement in the 
processing speed can be achieved with negligible video-quality degradation. 

An object of the invention is to support scalability of a given algorithm. To 
this end, the invention provides a method and device for coding a set of input values into a 
set of coefficients, a method and device for inverse transforming, a video system, a signal, a 
storage medium, a method and device for determining a calculation cost of a given algorithm, 
a database, and a computer program as defined in the independent claims. Advantageous 
embodiments are defined in the dependent claims. 

Scalability means, inter alia, that quality can be exchanged v^dth algorithm 
complexity or computational power: a loss of quality can be excepted in exchange for a 
reduction in algorithm complexity or computational power, vice versa. 
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A first embodiment of the invention provides coding a set of input values into 
a set of coefficients by use of a given algorithm, the method comprising: selecting 
coefficients to be calculated, out of a total set of possible coefficients that can be calculated 
by the given algorithm given the set of input values, in which selecting higher priority is 
given to coefficients which require a lower calculation cost compared to other coefficients, 
and calculating the selected coefficients to obtain the set of coefficients. By selecting the 
coefficients which require a lower calculation cost, a higher number of coefficients is 
calculated given a limited number of calculation steps or a limited time period. The number 
of calculated coefficients is related to the quality. 

The invention is especially advantageous for algorithms that transform input 
values in a first domain (e.g. a temporal or spatial domain) into coefficients in a second 
domain (e.g. a frequency domain). A coefficient in the second domain may contain 
information on all values in the first domain, but only at a given level other than other 
coefficients. In this case, if more coefficients are available, a more accurate representation of 
the values in the first domain can be given. The coding is advantageously a video coding, 
wherein the input values form a block of pixel values, and the coefficients are transform 
coefficients selected out of a block of possible transform coefficients. 

In an advantageous embodiment of the invention, for a given coefficient the 
calculation cost is at least partly based on an amount of calculation steps that is required to 
calculate the given coefficient reduced with an amount of calculations that can be shared with 
the calculation of other selected coefficients, and wherein in the step of calculating results of 
shared calculation steps are re-used in calculating other coefficients which share the shared 
calculation steps. By selecting those coefficients that require a lower calculation cost while 
taking into account the number of calculation steps that can be shared leads to a more optimal 
selection. Given limited resources, more coefficients can be calculated in this way. In a 
practical embodiment, in the calculation of the selected coefficients, intermediate results of 
shared calculation steps are stored in a memory and retrieved for re-use in calculating other 
coefficients when necessary. 

In the selecting step, the number of coefficients to be calculated can be 
maximized given a maximum total calculation cost. In this embodiment a maximum quality 
is reached given the limited computational power. In this embodiment, the order of 
computation after selection may be arbitrary. Alternatively, given a desired number of 
coefficients to be calculated, the minimal required calculation cost can be determined. This 
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may be useful in allocating calculation resources to the given algorithm relative to other 
algorithms or applications. 

According to an advantageous embodiment, in addition to already calculated 
coefficients, a repeated selection of a next coefficient is performed until a stop criterion is 
met, for which next coefficient the calculation cost is minimal compared to other possible 
coefficients which are not yet calculated. In this embodiment 'on-the-fly' computation is 
possible, wherein the calculation is stopped when a computation limit or a certain time period 
has been reached. The algorithm can be reprogrammed to process the calculation steps in this 
specific order until a (time) limit is reached. Within this (time) limit, results can be updated 
from time to time. The algorithm is now independent of the computer system used, which can 
have an arbitrary computation power. The algorithm will calculate as many coefficients as 
possible within the given (time) Hmit and possible other constraints. Also in this embodiment 
the, the calculation cost is preferably at least partly based on the amount of calculation steps 
required to calculate the next coefficient reduced with an amoxmt of calculation steps that are 
shared between the calculating of the next coefficient and calculation steps already performed 
for already calculated coefficients. 

The invention is advantageously applied in a programmable video 
architecture. In this embodiment, a scalable (MPEG) coding algorithm is provided that 
features scalable video quality with respect to available computational power, which power 
may depend on the desired application. Given a limited computational power, this 
embodiment still preserves the quality as good as possible. One of the time-consuming basic 
algorithms of video processing applications is the calculation of the Discrete Cosine 
Transformation (DCT), but the inventions is also applicable to other algorithms. In the case 
of a transform algorithm, at a given computational limit, a maximum number of transform 
coefficients is calculated within the given calculation limit. 

In a preferred embodiment of the invention, a scan order is used which is at 
least partly determined by which coefficients are calculated. Such a scan order may be 
transmitted to the decoder, e.g. per frame. This allows adapting the scan order per frame, 
which is advantageous in encoder processing and m bit-rate. The specific scan order is 
transmitted per frame and is therefore present in the transmitted signal. If all calculated 
coefficients are present in the transmitted signal, an End Of Block (BOB) may be inserted in 
the transmitted signal to indicate that for the given block no more coefficients are 
transmitted. 
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In an alternative embodiment of the invention, a predetermined scan order is 
used such as the zig-zag scan or alternatively the alternate scan both defined in MPEG, 
wherein a predetermined value is put in the resulting bit-stream for tiie non-calculated 
transform coefficients. This predetermined value is zero in a practical embodiment. The 
signal according to this embodiment of the invention will therefore have a specific pattern of 
zeros depending on the amoimt of transform coefficients that could have been calculated 
given limited computational power. In the case of low bit-rate, a lot of zeros is non-optimal. 
Also in this embodiment, an MPEG compliant decoder can decode the transmitted signal. 
Because a specific selection of possible transform coefficients is calculated, the result of this 
embodiment of the invention is discemable in the transmitted signal. 

Favorable computation and/or scan orders may be determined off-line for a 
given transform algorithm, which favorable order is stored in a database (e.g. a look-up-table) 
in the encoder. The computational order need not to be the same as the scan order, but to save 
memory it is preferred that they are similar. In the case a non-standard scan order is used, an 
indication of which scan order has been used should be inserted. However, when the same 
database is also stored in the decoder, which is preferred, it is not necessary to transmit the 
order of the coefficients or the database/look-up-table to the decoder. In this case an index 
stiffices which indicates which scan order out of a set of scan orders has been used in the 
encoder. In the case only one predetermined scan order is used, the scan order need not to be 
transmitted. 

In the encoder, based on the available calculated transform coefficients, a scan 
order of the coefficients may be determined which is the most favorable for use in a decoder. 
Depending on how many coefficients can be buffered in the decoder, it is advantageous to 
transmit the transform coefficients in an order approximately similar to the most efficient 
computation order in the decoder. The decoder advantageously decodes the coefficients on 
the fly individually or per group of coefficients in the order as present in the transmitted 
signal. 

Advantageously, at least one additional criterion is used in selecting the 
transform values to be calculated. Because some coefficients are more important for picture 
quality than others, priority setting between coefficients is usefijl. The priority can for 
example be set by multiplying the calculation cost in the database by a priority function of 
any sort, or by sorting the coefficients into different priority groups that give a process order 
per group. Depending on different types of image blocks, different priority levels can be 
chosen for the algorithm output, to find input-dependent calculation styles. 
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Preferably, one priority criterion might be based on how often the coefficient 
value is zero (after quantization). Coefficients that are often zero should get a lower priority. 
In a decoder, adapting a computation order of the coefficients depends on the coefficients 
received and how many of these coefficients can be buffered. 

An inverse transformation operation may in the context of this invention also 
be construed as a transformation operation. In this case, the input values are formed by the 
coefBcients and a selection is made between possible output values, e.g. pixel values. Non- 
calculated pixel values may be filled in by a predetermined value or a may be derived from 
surroxmding pixel values, e.g. by averaging. Alternatively, a selection is made out of the 
coefficients which are input to the algorithm to calculate the output values. Also in this case a 
calculation cost is minimized, not by selecting which of the output values to calculate, but by 
selecting which of the available/received transform values are used as input to the algorithm 
to calculate the pixel values. If not all available transform values can be used due to the 
limitation in calculation steps that can be performed, the output values will be less accurate, 
but in the case of an image still a value is obtained for any pixel of the image (block). 

The invention further relates to a video system comprising at least an encoding 
device according to an embodiment of the present invention, and a decoding device. An 
example of such a video system is a closed system for digitally storing video material on a 
Hard Disc Drive (HDD). Other examples are video conferencing systems, digital hand-held 
cameras, etc. In the case the video material is analog the video system additionally comprises 
an analog to digital converter. If the encoder in this video system produces an MPEG 
compliant bit-stream a standard decoder may be used. Advantageously, the decoder in the 
video system is a decoder according to an embodiment of the present invention. 

The invention further relates to a method of analyzing a calculation cost of an 
algorithm. The analysis returns a database of a calculation cost as a function of coefficients. 
With this database, a list of coefficients is deductible which provides information on which 
coefficients can be calculated within a given calculation limit. Such a database can be used in 
(de-)coding according to embodiments of the present invention. 

The aforementioned and other aspects of the invention will be apparent from 
and elucidated with reference to the embodiments described hereinafter. 

In the drawings: 

Fig. 1 shows the periodicity of the cosine function; 

Fig. 2 shows the zig-zag scan order as used in H.263 and MPEG; 
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Fig. 3 shows a calculation from inputs A to ou^uts B according to an 
embodiment of the invention; 

Fig. 4 shows a calculation order of coefficients in a DCT matrix according to 
an embodiment of the invention; 
5 Fig. 5 shows a calculation order of coefiRcients in a DCT matrix according to 

an embodiment of the invention which takes into accovmt an additional priority for the upper 
left comer of the matrix; and 

Fig. 6 shows a video system according to an embodiment of the invention. 

The drawings only show those elements that are necessary to xmderstand the 

|J,0 invention. 

C5 
O 

# For a better understanding of the invention, some basic theory on the DCT 

transformation is given first. The DCT transforms the luminance and chrominance values of 

.^'^ small square blocks of an image to the transform domain. Afterwards, all coefficients are 

A 5 quantized, and the signal concentration into a small amoimt of coefficients ensures that the 

|X whole image can be saved with less data than the original. 

2 For a given image block, represented as a 2D data matrix {x[/,7];/,y=0,l,...,iV-l}, the 2D 

P DCT matrix {x[i,j];i,j=OX...,N-l} is given by 



OA 1 2 . , ^ , r .1 (2i+l)m*7r (2j+l)n*7r 

20 X\m,n\=—*u(m)*u(n)*y > xlj,7|*cos-5^ ^ *cos^ — ^ (1) 

where 



I 1 Otherwise 



To reduce the complexity of Equation (1), the row-column method is often 
used. With this method, each row and column of an image block is transformed separately by 
a ID-DCT. For a given ID data vector {x\i];i =OX...,N-l}, the ID-DCT vector 
{X\i];i =0,l,....iV-l} is defined by 



30 
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Both Equations (1) and (2) have the form of: 

Output =Const * InputMatrix * CosMatrix (3) 

The constant part of Eqmtion (3) can be merged into a later quantization step 
where transformed coefficients are removed for data compression purposes. Of course, the 
input data cannot be modified. The interesting third part is the cosine matrix. 
Transformations of this matrix are based on the periodicity of the cosine function. The cosine 
function is periodic, which means that results of the function repeats every 
27r. cos(a)=cos(K*27r+a);n e Z Furthermore, the cosine function is anti-periodic over n: , 
which means that results of the function repeat every tv , but the sign changes: 
cos(a)=(-l)" *cos(«*7r+a);« e Z Fig. 1 shows the plot of the cosine function, where four 
arrows are marked that have the same absolute value. 

Most known DCT algorithms are designed for maximal video quality. 
Different strategies can be found to reduce the complexity of the DCT computation by 
mathematical transformations of Equation (1) or (2): Lee and Huang [1] reduce the 
calculation of the cosine matrix to equivalent sub-problems of a lower complexity. They 
normalize each angel a of the cosine matrix to 0 < |a| < O.Sti and therefore a 2" x 2°-DCT is 
reduced to 2"'' x 2""*-DCT's of lower complexity. Cho and Lee [2] found data dependencies 
between two cosine matrixes given in Equation (1) to represent one of the matrixes as 
function of the other matrix. Therefore, the 2D-transformation has been reduced to a ID- 
transform, where the selection of the ID-DCT algorithm is free of choice. Arai, Agui and 
Nakajima [3] deduce the DCT from a Discrete Fourier Transform (DFT), where several 
multiplication's can be absorbed in later quantization step. 

Further, algorithms are known which reduce the computation complexity of 
the DCT to speed up calculation time, whereby a loss of video quality is accepted: Merhav 
and Vasudev [4] developed a calculation scheme for DCT and inverse DCT (IDCT). The 
main idea is to exchange all multiplications with shift operations and compensate the 
resulting error as good as possible in a later quantization step with no additional cost. Pao and 
Sun [5] made statistical analysis of encoding different video sequences with the video coding 
standard H.263. This coding standard saves an image block after the calculation of the DCT 
in a zigzag order as shown in Fig. 2, xmtil all non-zero values have been saved. The 
remaining zeros are replaced by an end-of-block (BOB) sign. From the analysis, variances of 
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the DCT coefficients can be represented as a function of the minimum mean absolute error 
(MMAE), which is taken after a motion-compensated prediction. Depending on this function 
and the quantization parameter of video coding standard H.263, thresholds have been 
measured to process an image block in different ways. Either the DCT is calculated for all 64 
coefficients, or for an approximate 4x4 low frequency DCT, or for the upper left coefficient 
(the value only, or the DCT is not performed at all. 

In the following, an embodiment of the invention is described wherein a 
specific computation order of the DCT coefficients is used depending on the DCT algorithm. 
After a computation step, the list of remaining coefficients is sorted such that m the next step 
the coefficient is computed having the lowest computation cost. In this case, the computation 
order gives a design rule for the DCT algorithm to maximize the number of coefficients 
within the given reduced computation power. Although this section concentrates on 
calculating a DCT, the matter described is also applicable to other algorithms, like the 
Inverse Discrete Cosine Transform (IDCT). 

The approaches by Merhav and Vasudev [4] and Pao and Sun [5] already 
accept of loss of quality for saving calculations. However, both approaches do not consider 
the basic DCT algorithm to take into account calculations that are shared in calculating 
respective transform coefficients. 

The knowledge of the basic DCT algorithm is important to find the best 
strategy for scaling it to lower video quality within given calculation effort and/ or time 
constraints. As a result, a specific algorithm is modified by eliminating several calculations 
and thus coefficients. The results of the algorithm then will have the best quality possible 
within the given constraints, because as many coefficients are calculated as possible. It is 
important to find out what calculations can be eliminated to keep a maximum of coefficients 
for the best possible video quality. Because the DCT algorithms process video data m 
different ways, the algorithm used for a certain application should be analyzed closely. 

The DCT algorithm is analyzed to find out the number of calculations, which 
are needed to obtain specific DCT coefficients. This analysis explores data dependencies 
between calculation nodes within the algorithm. A database can be build for every calculation 
step, when going from the input values to the finally ti-ansformed coefficient and what 
calculations are still needed to obtain another coefficient. If a computation limit is set, it is 
preferable to calculate coefficients that share calculation steps. The number of coefficients is 
then maximized with minimum effort. 
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The analysis step and the advantage of this method is explained with an 
example of a short calculation given in Fig. 3. This example shows a calculation with three 
intermediate results ti, t2 and t^. The calculation cost for coefficients Bi, B2 and B3 are 
determined by counting all operations that are needed to calculate each of the coefficients 
starting from the input values. For example, Bi is calculated by Bl = ti * Ci = (Ai + A2) * Ci 
and therefore consists of one addition (within ti) and one multiplication. This information is 
stored in a database as given in Table 1, where one multiplication is set to be equivalent to 
three additions as an example. 



overall calculations 




ti 


t2 


t3 




B, 


B2 


B3 


additions 




1 


1 


1 




ti+0=l 


ti+t2+l=3 


t3+0=l 


multiplications 




0 


0 


0 




ti+l=l 


ti+t2+0=0 


t3+l = l 


operations count 




1 


1 


1 




1+1*3=4 


3+0*3-3 


1+1*3=4 



Table 1 : Calculation cost according to an embodiment of the invention. One addition is 
counted as one operation, one multiplication is counted as three operations. 



Using this database, we can focus on finding the next DCT coefficient that 
needs the least operations, depending on the calculations already done. This will give an 
algorithm-dependent calculation order of the coefficients. In the example given in Fig. 3, B2 
will be calculated in a first step, because it only needs three operations. Coefficients Bi and 
B3 have the same calculation cost, so there seems no difference whether to calculate Bi or B3 
first. However, coefficients Bi and B2 share node ti, which leads to less remaining calculation 
cost for Bl than B3 in the second step. This can be seen in Table 2, where the database of 
Table 1 has been updated by the information, that B2 has been calculated. 



remaining calculations 




ti 


t2 


t3 




Bl 


B2 


B3 


additions 




0 


0 


1 




0 


0 


1 


multiplications 




0 


0 


0 




1 


0 


1 


operations count 




0 


0 


1 




3 


0 


4 



Table 2: Remaining calculation cost after B2 has been calculated. 



Therefore, it is preferable to calculate the given coefficients in this order: B2, 
Bl, B3. If the computation power is reduced to six operations for this example, coefficients B2 
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and Bi can be calculated. With a calculation order of B2, B3, Bl, only B2 woidd be calculated, 
because the first two coefficients B2 and B3 need seven operations together. 

The approach explained in this section has been used to find a calculation 
order for the 2D-DCT algorithm by Cho and Lee [2] including the ID-DCT algorithm by 
Arai, Agui and Nakajima [3]. The result is shown in Fig. 4. 

The calculation order can be improved, if a quantization step after the 
calculation of the DCT is considered. In most cases, the important values of a transformed 
image block can be found in the upper left comer of the block. The quantization step removes 
less important values for data compression purposes. Therefore, the coefficients can be 
combined with a priority function to prefer coefficients in the upper left comer. The 
calculation order given in Figure 5 was found by multiplymg the number of operations for 
coefficient C[ij] (stored in the generated database) with a priority function p(ij)=i*2+|i-j|+l. 
Function p was found by some experiments and seems to be suitable for a first 
implementation. 

Table 3 shows how this variation leads to another calculation order. Here, one 
multiplication is set to be equivalent to three additions and the first two coefficients Coo and 
C44 have already been calculated. It is clear that the next coefficient to be calculated is C04 
without using a priority fimction, but C22 when using priority fimction p. 





Co4 


C22 


additions left to calculate coefficient 


7 


9 


multiplications left to calculate coefficient 


4 


4 


operations count - without priority fimction 


19 


22 


priority function p(i,j) 


13 


9 


operations count - scaled with priority function 


247 


189 



Table 3: Decision of next coefficient to be calculated. C04 is preferred without using a priority 
function, C22 when using priority function p. 

A further enhancement is that the calculation order can be optimized with a 
priority function, which is designed for certain contents of an image block. For example, 
image blocks are categorized in three different groups: image blocks containing horizontal 
lines, vertical lines or blocks without a clear structure. In each of these three groups, the DCT 
will prefer specific coefficients to describe the original image block. This can be expressed 
with a priority function. A short pre-analysis of each image block can be performed or taken 
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from other functions that do similar analysis, to ensure that the most important coefficients 
are calculated first. 

Within the MPEG standard, a zigzag order as shown in Fig. 2 is used to code 
DCT coefficients, because the most important values are normally found in the upper left 
5 comer of the quantized block. Using this zigzag order as a calculation order, many time- 
consuming calculations have to be done at the beginning of the computation to obtain the 
first coefficients, because these values depend on different inputs and no intermediate result 
can be reused. For a reduced computation power, this would result in fewer coefficients to be 
used afterwards. Thus finding the best computation order is usefiil. 
1 0 The operations count for a given number of coefficients of the zigzag order 

H' has been compared with the calculation-optimized order presented in this section. It can be 

y 

O noticed that the calculation-optimized order leads to significantly more coefficients 
m calculated, which results in a better video quality. The SNR improves between 1-5 dB. 

The method presented is practical for scalable algorithms in many ways. 
Si 5 Instead of presenting a specific quantity of coefficients to be calculated, it can be used for 
Q automatic quality scaling. For example, running a real-time video application on a PC with 
P low computation power may fail, because this PC is not able to complete all calculations in 
real-time. In this case, the video processing will be aborted or show hick-ups. To solve this 
E problem, the video processing software can update a list of already calculated coefficients, 
20 until the next block has to be processed or a user-defined time limit is reached. With this 
solution, full screen and fiiU temporal viewable video can be ensured. 

This embodiment of the invention provides an advantageous method for 
computing the DCT in a special order to support scalability. This is achieved by analyzing 
each calculation step of a DCT algorithm to find coefficients that should computed next with 
25 minimum effort. The method maximizes the SNR of the picture during the computation by 
obtaining a high amount of DCT coefficients up to the point of consideration. 

The computation method can be enhanced by various features such as a 
prioritization fiinction, which favors the computation of low-frequency coefficients so that it 
fits better with MPEG coding after performing a DCT. The technique can successfiilly 
30 implemented for an IDCT as well. 

Fig. 6 shows a video system comprising a video source 1, a transmitter 2, a 
communication channel or storage medium 3, a receiver 4 and a display device 5. The video 
source 1 may be a camera or the like and fiimishes a video source signal SI to the transmitter 
2. The transmitter 2 comprises a video encoder 20. The video encoder comprises a 
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calculation unit 201, a memory 202 and an output unit 203. The calculation unit calculates 
from the input samples of the video source signal SI a set of transform coefficients that are 
included in the coded output signal S2 which is transmitted over the communication channel 
3 or alternatively stored in the case the communication channel 3 is a storage medium. The 
video encoder 20 further comprises a memory 202, which is used for storing intermediate 
results of calculations in the calculation unit 201 . The intermediate results are typically 
results from calculations that are shared between the calculations of respective transform 
coefficients that are calculated in the calculation unit 201. The memory 202 can fiirther be 
used to store a scan order or computation order of the transform coefficients. The output unit 
203 formats the transform values into a suitable format for transmission. In video encoders 
such as MPEG encoders, transform coefficients are usually quantized to reduce the number 
of bits necessary to represent the transform value. In Fig. 6, necessary quantize operations are 
assumed to be performed in the calculation unit 201. Although not shovm in Fig. 6, MPEG 
encoders usually also comprise elements for performing motion estimation and compensation 
for predictively coding pictures. The output unit 203 may perform operations like variable 
length encoding, multiplexing and channel coding. 

According to an embodunent of the invention, the computation order is 
algorithm dependent, although the computation order may additionally be determined by a 
priority function, which takes other conditions into accoxmt, as described earlier. The scan 
order may be identical to the computation order, but that is not necessary. In any case, the 
decoder should be synchronized with the encoder on the scan order. The decoder may use 
another computation order than the encoder, because for a decoding algorithm(s) another 
computation may be more efficient. 

The receiver 4 comprises a decoder 40. The video decoder 40 comprises an 
input unit 403, a calculation unit 401 and a memory 403. The input unit receives a coded 
video signal S2' from the communication chaimel or storage medium 3. The coded video 
signal S2' will normally be identical to the signal S2, although S2' may contain errors 
introduced by the communication channel or storage medium 3. The input unit 403 may 
perform operations like variable length decoding, demultiplexing and channel decoding, 
normally inversely to the operations performed in the output unit 203. The calculation unit 
401 performs an inverse transformation to calculate pixel values fi-om the received transform 
coefficients. The pixel values are included in an output signal SI ' which is a reduced quality 
version of the video source signal SI. The output signal SI ' is displayed on the display unit 
5. 
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The decoder 40 may be a standard decoder. Advantageously, the decoder 40 is 
a decoder according to an embodiment of the invention. As already explained, a selection 
may be made between the available transform coefficients which are input to the inverse 
transform, in which selection higher priority is given to transform coefficients which require 
5 a lower calculation cost than other coefficients, also based on the amount of calculation steps 
required for the selected transform coefficients and the amount of calculation steps that can 
be shared. For this purpose, the memory 402 may contain a database which indicates which 
of the available transform coefficients may be calculated given a maximum computation 
power. In a further embodiment, the memory 402 stores a scan order used by an encoder 
1 0 according to an embodiment of the invention, which scan order is determined by which 
coefficients are calculated or which scan order is even approximately similar to the 
O computation order in the encoder. 
* ^ ■ ■ • 

fij. The mvention is advantageously applied in applications that need real-time 

"1; video encoding on one hand, but have further restrictions on the other hand, such as: 
^1 5 Video conferencing systems which have a low video resolution and often 

J==. communicate the video stream via a narrow-bandwidth connection. This leads to 

H' communication delays between the conference participants, which delay has to be 

M 

^ mmimized. Furthermore, video conferencing is an example where video with sufficient 

O 

1 temporal resolution is more important than high spatial video quality. 

20 Digital hand-held video cameras which should be handy, cheap and of good 

quality to be accepted by the consumer. These cameras have a medium resolution and 

therefore need more complex video processing algorithms than video conferencing systems. 

To limit the cost of a camera, these algorithms should be programmable in software or should 

lead to simple hardware solutions. 
25 Televisions with general-purpose computation power. Part of the available 

computation power can be saved by scaling the given algorithms for video applications to 

lower complexity, therefore enabling the television to perform other tasks in parallel. 

Otherwise, the video application could block other applications of interest. 

The invention is further applicable to parametric coding schemes, wherein 
30 input values are coded into a set of parameters. In the claims, coefficients should be 

construed as parameters in these coding schemes. 

It should be noted that the above-mentioned embodiments illustrate rather than 

limit the invention, and that those skilled in the art will be able to design many alternative 

embodiments without departing from the scope of the appended claims. In the claims, any 
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reference signs placed between parentheses shall not be construed as limiting the claim. The 
word 'comprising' does not exclude the presence of other elements or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer. In a device claim 
enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 
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